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To investigate what children learn during computer 
programming instruction, students attending a summer computer camp 
were asked to recall either single lines or chunks of computer 
programs from either coherent or scrambled programs. The 16 subjects, 
ages 12 to 17, were divided into three instructional groups: (1) 
beginners, who were taught to program in Applesoft BASIC; (2) 
intermediate, who were taught advanced concepts such as text files in 
Applesoft BASIC; and (3) advanced, who already had a background in 
BASIC and were taught PASCAL. The instruction involved programming 
syntax, debugging, planning, and use of a top-down programming 
structure. BASIC programs of 16 to 18 lines in length were arranged 
in a top-down structure or scrambled to separate lines which formed 
coherent procedural chunks. Students had two minutes to study the 
program and four minutes to write it down. Numbers of correct lines 
and chunks recalled were analyzed for each ability group and program 
version. All groups indicated greater recall of the coherent 
programs, especially the intermediate group. Increasing ability was 
associated with recall of the scrambled programs. Advanced 
programmers also commented that the scrambled programs did not make 
sense. (GDC) 
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Adolescents' Chunking of Computer Programs 
In the last decade, much has been made of the cognitive 
benefits of computer programming for children (e.g., Papert, 
1988). Students have been taught such programming languages as 
BASIC and LOGO with the goals of increasing their planning 
abilities, problem-solving skills, and metacogn i t i ve awareness 
of the problem-solving process itself. However, the research 
to date has produced contradictory results concerning the 
effects of learning to program on the development of children's 
complex cognitive skills (Pea & Kurland, 1984). At this point 
in time, it seems logical to focus research efforts on more 
basic questions. Specifically, what children do learn about 
the programming process and how do they organize that knowledge 
in memory? 

Researchers of the psychology of computer programming has 
employed an information processing perspective in their 
attempts to uncover knowledge about the cognitive processes of 
programming (e.g., Brooks, 1977; Mayer, 1981). Specifically, 
problem solving theories have served as a theoretical framework 
for the investigation of the cognitive processes required to 
construct workable computer programs (e.g., Newell 6c Simon, 
1972) . 

In focusing on computer programming, recent studies using 
adult subjects suggest that expert programmers have a "plan 
library" of basic computer programming "schemas," or recurrent 
functional chunks of programming code that are often used (Pea 
& Kur 1 and, 1983) . In an i nvest i gat i on of the knowl edge 
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structures of programmers with varying levels of expertise, 
McKeithen et al . <1?81) found that experts could recall more 
program lines and procedural chunks across trials than either 
intermediate or novice programmers. Further, experts were able 
to recall larger and more highly-organized chunks of a coherent 
program (using the ALGOL W programming language) than were 
i n termed i ate or nov i ce programmers. However , when students 
were given a scrambled version of the program, no differences 
were found across the groups. These results suggest that 
coherent programs were more meaningful for the experts, thus 
allowing them to apply what they knew about programming to the 
recall task. 

The information derived from the recall protocols of 
novices indicated that these individuals focused on surface 
structures of the program. That is, they used mnemonic 
techniques based on orthography to enhance recall. The 
intermediates recalled small chunks of programming code, 
typically those expressed in natural language terms (i.e., 
IF-THEN-ELSE and FOR-STEP-WHI LE-DO) . Expert programmers 
demonstrated a great deal of clustering of the ALGOL W 
commands. These clusters were based on the function of the 
commands and the flow of control processes in the program. 
These results suggest that the experts' knowledge structures 
were richer and more complex than the structures of the 
i n termed i ate or nov i ce programmers. 

In a related study, Adelson (1981) has found differences 
in memory capacity and the nature of the knowledge structures 
of expert and nov i ce programmers. When asked to recal 1 
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programs in the Polymorphic Programming Language <PPL) code, 
experts were able to recall more of the program than novices. 
Further, the findings suggest that while both groups of 
programmers had conceptual categories for the elements of PPL, 
the expert programmers seemed to have a more compl ex 
conceptual izat ion of the code. That is, the novice 
programmers' recal 1 s were i n the form of a syntax-based 
organization, while the expert programmers' recalls assumed a 
semantic, hierarchical organization based on principles of 
command functions. 

Research on the changes in ch i 1 dren's conceptual 
structures after instruction in a computer programming 
environment has just begun to emerge. For example, Pea & 
Kurland <1983) examined children's learning of programming 
syntax and comprehension of frequently-used subroutines. Their 
findings indicate that child programmers learn the basic syntax 
of the language and can construct simple, workable programs. 
However, they are not able to understand their own program's 
f 1 ow-of-control or more advanced concepts of recursion, 
conditionals, and variables. 

In an effort to investigate what children do learn in a 
programming environment, the purpose of this study was to 
examine how students of varying programming abilities recall 
and organ i ze programmi ng concepts. Specifically, this study 
focused on the recall of advanced, intermediate, and beginning 
adolescents for coherent and scrambled computer programs. The 
specific research questions were: 

1. What is the relationship between level of 
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programming expertise and the number of program 
lines and chunks recalled from coherent versus 
scrambled programs? 
2. What is the relationship between level of 

programming expertise and the nature of line and 
chunk recall in coherent versus scrambled 
programs? 

Design of the Study 

Setting 

The study was conducted during the second session of the 
Virginia Tech Computer Camp during the third and fourth weeks 
of July, 1985. 
Treatment 

On the first day of camp of the 12-day camp, all campers 
took a test on programming skills to facilitate placement in 
appropriate instructional groups: beginning, intermediate, and 
advanced. Begi nn i ng programmers were taught to program i n 
Applesoft BASIC. Intermediate programmers were taught advanced 
concepts in Applesoft BASIC (e.g., text files). Advanced 
programmers had a background in BASIC and were taught PASCAL, a 
more structured, procedurally organized language than BASIC. 
At all levels, instruction involved the learning of programming 
syntax and debugging. In addition, the instruction emphasizes 
structured programming skills which included thoughtful 
planning before beginning on-line programming and the use of 
top-down programming structure. A typical daily schedule 
included approximately 5 hours of instruction, 4 of which were 
spent in planning programs or on-line programming. 
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Participants 

From a pool of 50 campers, ages 10 to 18, 5 advanced 
programmers, 7 intermediate programmers, and 4 beginning 
programmers agreed to participate in the study. Four of the t6 
participants were girls, 12 were boys. This ratio of girls to 
boys was approximately the same for the camp as a whole. 
Participants ranged in age from 12 to 17. 
Mater i al s 

Two coherent and two scrambled versions of computer 
programs were constructed by the researchers and pilot testeo 
during the first session of the camp (i.e., first two weeks of 
July). These programs reflected the concepts that were taught 
in the beginning classes and reinforced in the intermediate and 
advanced classes. All programs were 16 to 18 lines long and 
written in the Applesoft BASIC programming language. Coherent 
programs were arranged in a structured, top-down programming 
style, which inherently organizes the program into procedural 

i 

chunks. Scrambled programs were constructed from programs 
consistinc of the same concepts as the coherent programs with 
the program lines rearranged to separate program lines that 
formed coherent procedural chunks. 
Data Col 1 ect i on 

Data was collected on the tenth day of the two-week 
session. All participants were gathered in one classroom and 
given a packet containing the four programs, four pieces of 
loose-leaf paper, and a pencil. The presentation of the 
to-be-remembered programs was randomly ordered for each 
participant. The par t i c i pants were instructed that they would 
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have two minutes to read the program and try to remember it as 
best as they could. At the end of two minutes, they were asked 
to close the program pach*: . ■ ; te on a clean piece of paper 
as much of the program as they could remember. The recall 
period lasted four minutes which appeared to be ample time for 
recall since all students completed their recall protocols, for 
scrambl ed and c oner en t programs, before the end of the 
four-minute time period. Between each trial the recall sheets 
were collected and the par t i c i p ants engaged in a br i ef 
di stractor task . 
Analysis of the Dat a 

The recal 1 sheets were scored for number of correc t 
programming lines. A program line was considered correct if 
the command was spelled correctly and subsequent syntax allowed 
the cist of the command line to be executed. For example, a 
recalled line would be counted correct if the original program 
line read, PRINT "Hi , I'm glad to meet you.", and the 
participant wrote, PRINT "Hi, nice to meet you." Also if the 
participant changed the variable letters in a loop (e.g., FOR I 
= 1 to 5; to FOR S =* 1 to 5), the program line was counted 
correct if all subsequent lines using the variable had assumed 
the correct variable assignment. The recall sheets were also 
scored for the number of procedural chunks recalled by the 
participants. Descriptive statistics were calculated for 
number of correct program lines and number of procedural chunks 
recalled. Two separate 3 x 2 x 2 split plot analyses of 
variance were conducted to assess group differences in line and 
chunk recall for each version of the coherent and scrambled 
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programs. Tukey tests were used to further investigate main 
effects within specific cells. Finally, protocols were 
qualitatively analyzed to examine the nature of the line and 
chunk recal 1 . 

Results and Discussion 
Table 1 reports the mean number of program lines and 
procedural chunks recalled across program conditions and 
groups. The raw data indicated that there was a difference in 
line and chunk recall according to program type (coherent 
versus scrambled). All three groups of programmers recalled 
more program lines and chunks from the coherent programs than 
from the scrambled programs. From the graphs of the raw data 
<see Figures 1 and 2), this difference was found to be 
especially apparent for the intermediate programmers. This 
finding may be due to the curricular emphasis at the computer 
camp, as well as the programmers' experience level. The 
advanced group, although familiar with the BASIC language, had 
been programming in PASCAL during their time at computer camp. 
These campers may not have been as able as the intermediate 
group to spontaneously apply schemata or templates for coherent 
models of BASIC programs. While both the intermediate and 
beginning programmers had been programming in BASIC, the 
intermediate group, with their increased level of experience 
and greater knowledge about what makes programs work, might 
have been more sensitive to the programs' level of coherence. 



Insert Table 1 and Figures 1 and 2 about here 
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The results of the two analyses of variance con-firmed 
these findings. There was a significant main effect for 
program version (coherent versus scrambled), for both line 
recall CF<1,13> = 7.22, p<.817] and chunk recall CF<1,13)« 
19.477, p<.881]. While not reaching an acceptable level of 
significance (i.e., p<.85> in either the line £F(2,13> ■ .497, 
p=NS] or chunk £F(2,13> » 3.358, p<.8663 recall analysis, an 
interaction between level of experience and program type can be 
seen in the graphs of the mean scores (see Figures i and 2). 
Contrary to the findings of Adelson (1981) and McKeithen et al . 
(1981), the advanced programmers did not recall the most 
program lines or chunks in the coherent condition. When 
compared with the advanced or beginning programmers, the 
intermediate programmers recalled more lines and chunks in the 
coherent condition, but not in the scrambled program condition. 
In the scrambled condition, the advanced programmers recalled 
more than the intermediates, who, in turn, recalled more than 
the beginners. 

This interaction may be explained in terms of the 
development of a specific versus general knowledge of 
programming. That is, the primary and, for the most part, sole 
programming language of the intermediate programmers was the 
BASIC language. During their ten days at camp, their expressed 
tasks had been to construct coherent programs that were 
procedurally organized, resulting in a specific knowledge of a 
programming language. The advanced group, on the other hand, 
had a working knowledge of BASIC and were expanding their 
knowledge of programming to another environment — PASCAL. 
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Thus, the advanced programmers were developing a broader, more 
general knowledge of programming. As a result, the advanced 
progra-nmers were able to use this general knowledge to organize 
procedural chunks in the scrambled, as well as the coherent, 
programs, while the intermediate programmers had to rely on 
their specific knowledge base of procedural 1 y-or i ented BASIC 
programs. 

Unlike the findings of McKeithen et al . <1?81), the 
beginning programmers showed significant differences in both 
the number of lines (p<.85> and chunks (p<.81> recalled in the 
coherent versus scrambled conditions. While the chunking 
findings may be suspect due to the narrow range of the raw 
scores (i.e., the beginners recalled only two procedural chunks 
from the coherent programs, and none from the scrambled 
programs), it seemed that, given the mean difference in line 
recall, they had developed a level of expertise that could 
differentiate coherent from scrambled programs. While this 
knowledge was not sufficient to enable the beginning 
programmers to recall as many lines or chunks as the more 
experienced programmers, they were able to demonstrate some 
degree of knowledge of the domain. 

Closer examination of the nature of the program line 
recall revealed two distinct patterns. First, the beginning 
programmers tended to group like commands together from the 
coherent, as well as the scrambled, programs. For example, one 
beginning programmer wrote all of the lines that called 
subroutines together (i.e., GOSUB 70, GOSUB 78, GOSUB 208). 
Another grouped all of the code that controlled the colors of 
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the programmed graphic <i.e., COLoR*V> £0l_0R»2, C0L0R=8>. 
Thus, while not forming chunKs r e l *\e<S to procedures, the 
beginners did organize their recall ac^rding to syntactic 
■features. The advanced programmers, Qr i the other hand, 
attempted to recall the scrambled, v^l 1 as the coherent, 

programs in a procedural order. That \p*Y seemed to try to 

make "sense" out of the scrambled p ro^r*M b * reordering some 
of the lines to produce workable procedural ^hunks. 

The second response pattern paralWl*^ the serial position 
effect reported by Rundus <\9?l> in h, ^ research on verbal 
rehearsal and word list recall. Hi * f I r^i ngs i nd i cated that 
the probability of recalling a word de&enq # d upon its position 
in a list. Typically, words appearing i n the beginning or at 
the end of the list were recalled m 0 r« c?*t*n than words in the 
middle of the list. Words *t the tje^jn^ng of the list were 
rehearsed more often than the other i^orq*, giving them a higher 
probability of being retrieved from ion^te r fl> memory. Words at 
the end of the lists were most recently s* e n , thus having them 
accessible in short-term memory. fhi* ^ame serial position 
effect was seen in beginning progr d mm.e r S' recall of coherent 
programs and all programmers' recall <?i ^crambled programs. 
These patterns of recall support the nOt'Qf, that the beginning 
programmers relied on a rote recall b^ed on syntactic or 
physical features for coherent an<a serried programs, while 
the more experienced programmers re^er t e q to this method only 
when they could not extract some meaning frcj^ the programs. 

These findings suggest that as d dOi events gain expertise 
in programming, they develop specific s che^ata *°r often-used 
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chunk* of programming procedures. Even the beginning 
programmers, who had only 18 day* of programming, demonstrated 
a rudimentary knowledge of the BASIC language and procedures. 
The findings from the intermediate group indicated that their 
proficiency as programmers was advanced enough to be successful 
when the program made sense, but fragile enough to falter when 
the program was incoherent. The advanced programmers seemed to 
be able to take a program, either coherent or scrambled, and 
order the lines Into meaningful procedural chunks that would 
allow the program to run successfully. Similar results were 
reported by McKelthen et al . <1?81> with adult programmers and 
Egan and Schwartz <1?7?) with electronics experts. In the 
McKelthen et al • study, the adult expert programmers added 
specific lines to the scrambled program in order to produce 
coherent nested loops and output sequences. In the Egan and 
Schwartz study, the electronics experts attempted to recall 
symbols systematically when the symbols of the electronics 
diagram were inappropriately positioned. 

Differences between advanced and intermediate programmers 
also became evident in their comments about the recall task. 
Only advanced programmers remarked about the incoherence of the 
scrambled programs and stated that those programs were more 
difficult to remember because "they didn't make any sense." 
Thus, the difference between intermediate and advanced 
programmers was seen in their desire to make the program "make 
sense" as well as in their confidence in declaring that 
something was wrong with the scrambled program. 

Cone 1 usions 
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The results of this preliminary investigation indicate 
that adolescents do learn and organize meaningful knowledge 
about computer programming much the same way that adults do. 
That is, those adolescents with more experience and practice 
develop a Itrvel of expertise that allows them to recall 
programs in terms of semantical 1y related procedures rather 
than syntactic or physical features. In terms of this study, 
even those participants who were not programming in BASIC on a 
day-to-day basis were still able to recall their prior 
experience with BASIC and the procedures that produce cer::-. 
programmed results. 

While the general i zabi 1 i ty of this study is restricted to 
this sample and setting, it is encouraging to see evidence that 
adolescents can acquire some levels of expertise in the area of 
programming. Continued study in this area will be able to 
reveal individuals' conceptual organization of this programming 
knowledge. Further investigation into the types of 

instructional environments and individual differences that are 
related to the acquisition of programming knowledge is also 
warranted. By assessing what children are learning in 
programming environments, we will be better able to ascertain 
the cognitive benefits of learning to program. 
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Tab It 1. Mean number of program I ines and procedura I chunks 
recoiled across condi t ions ond groups. 
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Coherent 


7.38 


0.63 


9.93 


2.36 


8.90 1 40 


<S0) 


2.20 


0.74 


4.25 


1.78 


3.78 1 .43 


Scrambled 


5.75 


0.00 


7.21 


0.14 


7.80 0.60 


<S0> 


1.28 


0.00 


2.42 


0.36 


1 .93 0.84 
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